การเขียนโปรแกรมผู้ประมวลผลแบบขนานขนาดใหญ่: วิธีปฏิบัติจริง: จุดเริ่มต้นของเทคโนโลยีการประมวลผลกราฟิก (GPU)

การกำเนิดของหน่วยประมวลผลกราฟิก (GPU) เป็นการเปลี่ยนแปลงอย่างสิ้นเชิงที่เกิดจากแรงผลักดันจาก ข้อกำหนดเร่งด่วนในเวลาจริง: ข้อกำหนดที่ไม่สามารถละเลยได้ในการแสดงภาพสามมิติที่ซับซ้อนภายในช่วงเวลาเพียง $1/60^{th}$ วินาที (16.67 มิลลิวินาที) ขณะที่โปรเซสเซอร์ (CPU) ดำเนินไปตามแนวทางหลายคอร์ที่ถูกปรับให้เหมาะสมกับการทำงานแบบลำดับต่อเนื่องที่ใช้เวลาน้อย แต่เมื่อความละเอียดภาพเพิ่มขึ้น ก็พบว่ามีข้อจำกัด เส้นทางการพัฒนาแบบหลายคอร์ ที่ออกแบบมาเพื่อการทำงานแบบลำดับต่อเนื่องที่ใช้เวลาน้อย แต่เมื่อความละเอียดภาพเพิ่มขึ้น ก็พบว่ามีข้อจำกัด

1. ข้อจำกัด 16.67 มิลลิวินาที

ในช่วงกลางทศวรรษ 1990 การเล่นเกมถึงจุดวิกฤติ โปรเซสเซอร์แบบลำดับเดียว (ซีพียู) ที่ต้องจัดการกับปัญญาประดิษฐ์และฟิสิกส์ ไม่สามารถคำนวณค่าพิกเซลจำนวนหลายล้านจุดได้เร็วพอที่จะคงความลื่นไหลของการเคลื่อนไหว ทำให้ต้องสร้างฮาร์ดแวร์เฉพาะทางเพื่อถ่ายโอนงานที่ซ้ำซากซึ่งเป็น กระบวนการประมวลผลกราฟิก.

2. การแยกสายสแกน (Scan Line Interleave - SLI)

ก่อนที่จะมีโครงสร้างแบบขนานภายใน บริษัท 3dfx ได้แนะนำ การแยกสายสแกน (Scan Line Interleave - SLI). โดยใช้การ์ดสองตัวทำงานร่วมกันเพื่อคำนวณเส้นแนวนอนสลับกัน ทำให้ภาคอุตสาหกรรมเปลี่ยนโฟกัสจากความเร็วของเธรดเดียว มาเป็นการเพิ่มประสิทธิภาพโดยรวมอย่างไร้ขีดจำกัด

3. ประสิทธิภาพการส่งผ่านเทียบกับความหน่วงเวลา

จุดเริ่มต้นของจีพียูให้ความสำคัญกับพื้นที่วงจรซิลิกอนสำหรับหน่วยคำนวณพื้นฐานมากกว่าการคาดการณ์การเปลี่ยนทิศทางที่ซับซ้อน แนวคิด 'กว้างแต่ช้า' นี้ทำให้จีพียูสามารถจัดการกับการคำนวณซ้ำ ๆ ของรูปสามเหลี่ยม ขณะที่ซีพียูเน้นงานตรรกะที่ไม่สามารถทำแบบขนานได้

TERMINALbash — 80x24

> Ready. Click "Run" to execute.

QUESTION 1

What is the specific 'time budget' required for 60 frames per second (FPS)?

33.33ms

16.67ms

10.00ms

100.00ms

QUESTION 2

How did 3dfx's SLI achieve early parallelism in consumer hardware?

By increasing the clock speed of a single chip.

By having two cards render alternating horizontal scan lines.

By sharing AI logic between the GPU and CPU.

By reducing the resolution of the frame.

QUESTION 3

Why did the GPU diverge from the standard multicore trajectory of CPUs?

GPUs needed deeper caches for complex branching.

GPUs prioritize throughput of simple math over low-latency serial logic.

CPUs became too expensive to manufacture for 3D graphics.

GPU architectures were designed to be smaller than CPUs.

QUESTION 4

In the context of 1990s gaming, what was the 'Real-Time Imperative'?

The requirement to run physics simulations on the GPU.

Processing millions of pixels within the strict frame window.

The transition from 16-bit to 32-bit computing.

Allowing the CPU to handle rasterization.

QUESTION 5

What is meant by the GPU's 'Wide and Slow' philosophy?

Using many simple processors at lower clock speeds to do massive work.

Designing physically wide chips that take longer to process data.

A design that favors high latency but high memory capacity.

Optimizing for single-threaded serial logic.